A Machine Learning Approach to Portuguese Pronoun Resolution
نویسندگان
چکیده
Anaphora resolution is an essential component of most NLP applications, from text understanding to Machine Translation. In this work we discuss a supervised machine learning approach to the problem, focusing on instances of anaphora ubiquitously found in a corpus of Brazilian Portuguese texts, namely, third-person pronominal references. Although still limited to a subset of the more general co-reference resolution problem, our present results are comparable to existing work in the field in both English and Portuguese languages, representing the highest accuracy rates that we are aware of in (Brazilian) Portuguese pronoun resolution.
منابع مشابه
Using 'Low-cost' Learning Features for Pronoun Resolution
We investigate a machine learning approach to Portuguese pronoun resolution. We presently focus on so-called ‘low-cost’ learning features readily obtainable from the output of a part-of-speech tagger, and we largely bypass deep syntactic and semantic analysis. Preliminary results show significant improvement in resolution precision and recall, and are comparable to existing rule-based approache...
متن کاملUsing ‘Low-cost’ Learning Features for Pronoun Resolution
We investigate a machine learning approach to Portuguese pronoun resolution. We presently focus on so-called ‘low-cost’ learning features readily obtainable from the output of a part-of-speech tagger, and we largely bypass deep syntactic and semantic analysis. Preliminary results show significant improvement in resolution precision and recall, and are comparable to existing rule-based approache...
متن کاملA Machine Learning Approach to Pronoun Resolution in Spoken Dialogue
We apply a decision tree based approach to pronoun resolution in spoken dialogue. Our system deals with pronouns with NPand non-NP-antecedents. We present a set of features designed for pronoun resolution in spoken dialogue and determine the most promising features. We evaluate the system on twenty Switchboard dialogues and show that it compares well to Byron’s (2002) manually tuned system.
متن کاملSupervised Ranking for Pronoun Resolution: Some Recent Improvements
A recently-proposed machine learning approach to reference resolution — the twin-candidate approach — has been shown to be more promising than the traditional single-candidate approach. This paper presents a pronoun interpretation system that extends the twin-candidate framework by (1) equipping it with the ability to identify non-referential pronouns, (2) training different models for handling...
متن کاملDisambiguation of the Neuter Pronoun and Its Effect on Pronominal Coreference Resolution
Coreference resolution, determining the appropriate discourse referent for an anaphoric expression, is an essential but difficult task in natural language processing. It has been observed that an important source of errors in machine-learning based approaches to this task, is the wrong disambiguation of the third person singular neuter pronoun as either referential or non-referential. In this p...
متن کامل